Toward a Task-based Gold Standard for Evaluation of NP Chunks and Technical Terms

نویسندگان

  • Nina Wacholder
  • Peng Song
چکیده

We propose a gold standard for evaluating two types of information extraction output -noun phrase (NP) chunks (Abney 1991; Ramshaw and Marcus 1995) and technical terms (Justeson and Katz 1995; Daille 2000; Jacquemin 2002). The gold standard is built around the notion that since different semantic and syntactic variants of terms are arguably correct, a fully satisfactory assessment of the quality of the output must include task-based evaluation. We conducted an experiment that assessed subjects’ choice of index terms in an information access task. Subjects showed significant preference for index terms that are longer, as measured by number of words, and more complex, as measured by number of prepositions. These terms, which were identified by a human indexer, serve as the gold standard. The experimental protocol is a reliable and rigorous method for evaluating the quality of a set of terms. An important advantage of this task-based evaluation is that a set of index terms which is different than the gold standard can ‘win’ by providing better information access than the gold standard itself does. And although the individual human subject experiments are time consuming, the experimental interface, test materials and data analysis programs are completely re-usable.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Authenticity Evaluation of TOEFL iBT Speaking Module from the Perspective of Applied Linguistics and General Education

For the first time, this study combined models and principles of authentic assessment from two parallel fields of applied linguistics as well as general education to investigate the authenticity of the TOEFL iBT speaking module. The study consisted of two major parts, namely task analysis and task survey. Utilizing Bachman and Palmer’s (1996) definition of authenticity, the task analysis examin...

متن کامل

SemEval-2010 Task 11: Event Detection in Chinese News Sentences

The goal of the task is to detect and analyze the event contents in real world Chinese news texts. It consists of finding key verbs or verb phrases to describe these events in the Chinese sentences after word segmentation and part-of-speech tagging, selecting suitable situation descriptions for them, and anchoring different situation arguments with suitable syntactic chunks in the sentence. Thr...

متن کامل

The Impact of Task-based Language Teaching on ESP Learners’ Productive Skills: From Task-based Instruction to Investigation of Learners’ and Instructors’ Attitudes toward the Course

Togetherness of English for Specific Purposes (ESP) and Task-Based Language Teaching (TBLT) has been the subject of many recent studies in English as a Foreign Language (EFL) and English as a Second Language (ESL) domain. Few studies, however, have addressed the impact of TBLT on ESP learners’ linguistic production. This study aimed at investigating the impact of task-based teaching on ESP lear...

متن کامل

The Impact of Teaching Chunks on Speaking Fluency of Iranian EFL Learners

Research on multiword clusters (chunks) is based on the assumption that native speakers use plenty of chunks in their everyday language and they are considered as fluent speakers of language. Therefore the present study was an attempt to investigate the impact of using chunks on speaking fluency of Iranian EFL learners. In the first phase of the study, the students of two intermediate classes s...

متن کامل

Algorithms for Minimum Risk Chunking

Stochastic finite automata are useful for identifying substrings (chunks) within larger units of text. Relevant applications include tokenization, base-NP chunking, named entity recognition, and other information extraction tasks. For a given input string, a stochastic automaton represents a probability distribution over strings of labels encoding the location of chunks. For chunking and extrac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003